Agent Skills: Automatic Speech Recognition (ASR)

Transcribe audio segments to text using Whisper models. Use larger models (small, base, medium, large-v3) for better accuracy, or faster-whisper for optimized performance. Always align transcription timestamps with diarization segments for accurate speaker-labeled subtitles.

UncategorizedID: benchflow-ai/skillsbench/Automatic Speech Recognition (ASR)

Author

benchflow-ai

https://github.com/benchflow-ai View all skills

Repository

benchflow-ai/skillsbench

benchflow-aiLicense: Apache-2.0

894231

Install this agent skill to your local

pnpm dlx add-skill https://github.com/benchflow-ai/skillsbench/Automatic Speech Recognition (ASR)

Skill Files

Browse the full folder contents for Automatic Speech Recognition (ASR).

Download Skill

Loading file tree…

Select a file to preview its contents.